RTCGAToolbox: A New Tool for Exporting TCGA Firehose Data

نویسنده

  • Mehmet Kemal Samur
چکیده

BACKGROUND & OBJECTIVE Managing data from large-scale projects (such as The Cancer Genome Atlas (TCGA)) for further analysis is an important and time consuming step for research projects. Several efforts, such as the Firehose project, make TCGA pre-processed data publicly available via web services and data portals, but this information must be managed, downloaded and prepared for subsequent steps. We have developed an open source and extensible R based data client for pre-processed data from the Firehouse, and demonstrate its use with sample case studies. Results show that our RTCGAToolbox can facilitate data management for researchers interested in working with TCGA data. The RTCGAToolbox can also be integrated with other analysis pipelines for further data processing. AVAILABILITY AND IMPLEMENTATION The RTCGAToolbox is open-source and licensed under the GNU General Public License Version 2.0. All documentation and source code for RTCGAToolbox is freely available at http://mksamur.github.io/RTCGAToolbox/ for Linux and Mac OS X operating systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages

Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as...

متن کامل

Extending TCGA queries to automatically identify analogous genomic data from dbGaP

Data sharing is critical to advance genomic research by reducing the demand to collect new data by reusing and combining existing data and by promoting reproducible research. The Cancer Genome Atlas (TCGA) is a popular resource for individual-level genotype-phenotype cancer related data. The Database of Genotypes and Phenotypes (dbGaP) contains many datasets similar to those in TCGA. We have cr...

متن کامل

TCGA-Assembler 2: Software Pipeline for Retrieval and Processing of TCGA/CPTAC Data.

Motivation The Cancer Genome Atlas (TCGA) program has produced huge amounts of cancer genomics data providing unprecedented opportunities for research. In 2014, we developed TCGA-Assembler (Zhu et al., 2014), a software pipeline for retrieval and processing of public TCGA data. In 2016, TCGA data were transferred from the TCGA data portal to the Genomic Data Commons (GDC), which is supported by...

متن کامل

Oil, Government’s Budget and Economic Growth: A Dynamic Panel Data Model for Selected Oil Exporting Economies

Recognition of economic growth determinants is one of the most important concerns for economists. In the oil exporting countries oil revenues play a significant role for the economy alongside with other economic growth determinants. This paper attempts to investigate the role of oil in selected oil-revenue dependent economies. Since oil revenue goes directly to public treasury and is expended b...

متن کامل

The Effect of the Origin of Oil Price Shocks on Macroeconomic Dynamics in an Oil-Exporting Country: An Open DSGE Model

In recent years, some research has focused on the importance of the origin of an oil shock for macroeconomic dynamics in both oil-exporting and importing countries. The existing literature lacks a proper open Stochastic Dynamic General Equilibrium (DSGE) framework to investigate the effect of the origins of oil shocks on macro variables in a two-country model consisting of an oil-exporting coun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014